NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

SUPER-Net: Trustworthy image segmentation via uncertainty propagation in encoder-decoder networks

https://doi.org/10.1016/j.patcog.2025.112503

Carannante, Giuseppina; Bouaynaya, Nidhal C; Dera, Dimah; Fathallah-Shaykh, Hassan M; Rasool, Ghulam (April 2026, Pattern Recognition)

Full Text Available
HONeYBEE: enabling scalable multimodal AI in oncology through foundation model-driven embeddings

https://doi.org/10.1038/s41746-025-02003-4

Tripathi, Aakash; Waqas, Asim; Schabath, Matthew B; Yilmaz, Yasin; Rasool, Ghulam (December 2025, npj Digital Medicine)

Full Text Available
Improving Radiology Report Conciseness and Structure via Local Large Language Models

https://doi.org/10.1007/s10278-025-01510-w

Hartsock, Iryna; Araujo, Cyrillo; Folio, Les; Rasool, Ghulam (April 2025, Journal of Imaging Informatics in Medicine)

Full Text Available
Using consensus-based reasoning and large language models to extract structured data from surgical pathology reports

https://doi.org/10.1016/j.labinv.2025.104272

Tripathi, Aakash; Waqas, Asim; Venkatesan, Kavya; Ullah, Ehsan; Khan, Asma; Khalil, Farah; Chen, Wei-Shen; Ozturk, Zarifa Gahramanli; Saeed-Vafa, Daryoush; Bui, Marilyn M; et al (December 2025, Laboratory Investigation)

Full Text Available
Privacy-preserving Federated Learning and Uncertainty Quantification in Medical Imaging

https://doi.org/10.1148/ryai.240637

Koutsoubis, Nikolas; Waqas, Asim; Yilmaz, Yasin; Ramachandran, Ravi P; Schabath, Matthew B; Rasool, Ghulam (May 2025, Radiology: Artificial Intelligence)

Artificial Intelligence (AI) has demonstrated strong potential in automating medical imaging tasks, with potential applications across disease diagnosis, prognosis, treatment planning, and posttreatment surveillance. However, privacy concerns surrounding patient data remain a major barrier to the widespread adoption of AI in clinical practice, as large and diverse training datasets are essential for developing accurate, robust, and generalizable AI models. Federated Learning offers a privacy-preserving solution by enabling collaborative model training across institutions without sharing sensitive data. Instead, model parameters, such as model weights, are exchanged between participating sites. Despite its potential, federated learning is still in its early stages of development and faces several challenges. Notably, sensitive information can still be inferred from the shared model parameters. Additionally, postdeployment data distribution shifts can degrade model performance, making uncertainty quantification essential. In federated learning, this task is particularly challenging due to data heterogeneity across participating sites. This review provides a comprehensive overview of federated learning, privacy-preserving federated learning, and uncertainty quantification in federated learning. Key limitations in current methodologies are identified, and future research directions are proposed to enhance data privacy and trustworthiness in medical imaging applications
more » « less
Full Text Available
Vision-language models for medical report generation and visual question answering: a review

https://doi.org/10.3389/frai.2024.1430984

Hartsock, Iryna; Rasool, Ghulam (November 2024, Frontiers in Artificial Intelligence)

Medical vision-language models (VLMs) combine computer vision (CV) and natural language processing (NLP) to analyze visual and textual medical data. Our paper reviews recent advancements in developing VLMs specialized for healthcare, focusing on publicly available models designed for medical report generation and visual question answering (VQA). We provide background on NLP and CV, explaining how techniques from both fields are integrated into VLMs, with visual and language data often fused using Transformer-based architectures to enable effective learning from multimodal data. Key areas we address include the exploration of 18 public medical vision-language datasets, in-depth analyses of the architectures and pre-training strategies of 16 recent noteworthy medical VLMs, and comprehensive discussion on evaluation metrics for assessing VLMs' performance in medical report generation and VQA. We also highlight current challenges facing medical VLM development, including limited data availability, concerns with data privacy, and lack of proper evaluation metrics, among others, while also proposing future directions to address these obstacles. Overall, our review summarizes the recent progress in developing VLMs to harness multimodal medical data for improved healthcare applications.
more » « less
Full Text Available
Self-Normalizing Multi-Omics Neural Network for Pan-Cancer Prognostication

https://doi.org/10.3390/ijms26157358

Waqas, Asim; Tripathi, Aakash; Ahmed, Sabeen; Mukund, Ashwin; Farooq, Hamza; Johnson, Joseph O; Stewart, Paul A; Naeini, Mia; Schabath, Matthew B; Rasool, Ghulam (August 2025, International Journal of Molecular Sciences)

Full Text Available
BaSIS-Net: From Point Estimate to Predictive Distribution in Neural Networks - A Bayesian Sequential Importance Sampling Framework

Carannante, Giuseppina; Bouaynaya, Nidhal; Rasool, Ghulam; Mihaylova, Lyudmila (September 2024, Transactions on Machine Learning Research)

Data-driven Deep Learning (DL) models have revolutionized autonomous systems, but ensuring their safety and reliability necessitates the assessment of predictive confidence or uncertainty. Bayesian DL provides a principled approach to quantify uncertainty via probability density functions defined over model parameters. However, the exact solution is intractable for most DL models, and the approximation methods, often based on heuristics, suffer from scalability issues and stringent distribution assumptions and may lack theoretical guarantees. This work develops a Sequential Importance Sampling framework that approximates the posterior probability density function through weighted samples (or particles), which can be used to find the mean, variance, or higher-order moments of the posterior distribution. We demonstrate that propagating particles, which capture information about the higher-order moments, through the layers of the DL model results in increased robustness to natural and malicious noise (adversarial attacks). The variance computed from these particles effectively quantifies the model’s decision uncertainty, demonstrating well-calibrated and accurate predictive confidence.
more » « less
Full Text Available
Employing Consensus-Based Reasoning with Locally Deployed LLMs for Enabling Structured Data Extraction from Surgical Pathology Reports

https://doi.org/10.1101/2025.04.22.25326217

Tripathi, Aaksh; Waqas, Asim; Venkatesan, Kavya; Ullah, Ehsan; Khan, Asma; Khalil, Farah; Chen, Wei-Shen; Ozturk, Zarifa Gahramanli; Saeed-Vafa, Daryoush; Bui, Marilyn M; et al (April 2025, medRxiv)

Surgical pathology reports contain essential diagnostic information, in free-text form, required for cancer staging, treatment planning, and cancer registry documentation. However, their unstructured nature and variability across tumor types and institutions pose challenges for automated data extraction. We present a consensus-driven, reasoning-based framework that uses multiple locally deployed large language models (LLMs) to extract six key diagnostic variables: site, laterality, histology, stage, grade, and behavior. Each LLM produces structured outputs with accompanying justifications, which are evaluated for accuracy and coherence by a separate reasoning model. Final consensus values are determined through aggregation, and expert validation is conducted by board-certified or equivalent pathologists. The framework was applied to over 4,000 pathology reports from The Cancer Genome Atlas (TCGA) and Moffitt Cancer Center. Expert review confirmed high agreement in the TCGA dataset for behavior (100.0%), histology (98.5%), site (95.2%), and grade (95.6%), with lower performance for stage (87.6%) and laterality (84.8%). In the pathology reports from Moffitt (brain, breast, and lung), accuracy remained high across variables, with histology (95.6%), behavior (98.3%), and stage (92.4%), achieving strong agreement. However, certain challenges emerged, such as inconsistent mention of sentinel lymph node details or anatomical ambiguity in biopsy site interpretations. Statistical analyses revealed significant main effects of model type, variable, and organ system, as well as model × variable × organ interactions, emphasizing the role of clinical context in model performance. These results highlight the importance of stratified, multi-organ evaluation frameworks in LLM benchmarking for clinical applications. Textual justifications enhanced interpretability and enabled human reviewers to audit model outputs. Overall, this consensus-based approach demonstrates that locally deployed LLMs can provide a transparent, accurate, and auditable solution for integrating AI-driven data extraction into real-world pathology workflows, including cancer registry abstraction and synoptic reporting.
more » « less
Full Text Available
BaSIS-Net: From Point Estimate to Predictive Distribution in Neural Networks - A Bayesian Sequential Importance Sampling Framework

Carannante, Giuseppina; Bouaynaya, Nidhal C; Mihaylova, Lyudmila S; Rasool, Ghulam (July 2024, Transactions on Machine Learning Research)

Full Text Available

« Prev Next »

Search for: All records